A Machine Learning Approach for Identification Thesis and Conclusion Statements in Student Essays

نویسندگان

  • Jill Burstein
  • Daniel Marcu
چکیده

This study describes and evaluates two essay-based discourse analysis systems that identify thesis and conclusion statements from student essays written on six different essay topics. Essays used to train and evaluate the systems were annotated by two human judges, according to a discourse annotation protocol. Using a machine learning approach, a number of discourse-related features were automatically extracted from a set of annotated training data. Using these features, two discourse analysis models were built using C5.0 with boosting: a topic-dependent and a topicindependent model. Both systems outperformed a positional algorithm. While the topic-dependent system showed somewhat higher performance, the topic-independent system showed similar results, indicating that a system can generalize to unseen data – that is, essay responses on topics that the system has not seen in training.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identifying Thesis and Conclusion Statements in Student Essays to Scaffold Peer Review

Peer-reviewing is a recommended instructional technique to encourage good writing. Peer reviewers, however, may fail to identify key elements of an essay, such as thesis and conclusion statements, especially in high school writing. Our system identifies thesis and conclusion statements, or their absence, in students’ essays in order to scaffold reviewer reflection. We showed that computational ...

متن کامل

Identifying Thesis Statements in Student Essays: The Class Imbalance Challenge and Resolution

A thesis statement or controlling idea is a key component of the Common Core State Standards of writing from grade 6 to grade 12. We developed a machine learning model to identify thesis statements in students’ essays in order to focus peer-reviewers on commenting on the presence and quality of an author’s thesis statement. Identifying thesis statements in essays can be considered as a classifi...

متن کامل

Discourse Element Identification in Student Essays based on Global and Local Cohesion

We present a method of using cohesion to improve discourse element identification for sentences in student essays. New features for each sentence are derived by considering its relations to global and local cohesion, which are created by means of cohesive resources and subtopic coverage. In our experiments, we obtain significant improvements on identifying all discourse elements, especially of ...

متن کامل

Finding the WRITE Stuff: Automatic Identification of Discourse Structure in Student Essays

automated feedback that helps them revise their work and ultimately improve their writing skills. These applications also address educational researchers’ interest in individualized instruction. Specifically, feedback that refers explicitly to students’own writing is more effective than general feedback.3 Our discourse analysis software, which is embedded in Criterion (www.etstechnologies.com),...

متن کامل

Learning to Identify Sentence Parallelism in Student Essays

Parallelism is an important rhetorical device. We propose a machine learning approach for automated sentence parallelism identification in student essays. We build an essay dataset with sentence level parallelism annotated. We derive features by combining generalized word alignment strategies and the alignment measures between word sequences. The experimental results show that sentence parallel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computers and the Humanities

دوره 37  شماره 

صفحات  -

تاریخ انتشار 2003